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A study of the message traffic flow at a Naval Communication 
Station was conducted on two days to determine the percentage of traffic 
following certain paths within the station, and the characteristics of 
those paths with regard to traffic volume, precedence, time in house, 
peak loading periods, and length. 

Incoming and outgoing messages were matched and sorted by 
computer. Analysis revealed that the main flow of traffic was incoming 
on AUTODIN and outgoing on other circuits; approximately 95% of the 
traffic went out on only one type of channel with little duplication within 
type of channel; duplication between terminations and broadcasts was 
minimal; and conflicting results were obtained on variation between 
path characteristics. 

Further testing using the described computer program is recom.- 


mended to provide a broader base for conclusions. 
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i TOD UCTION 


The purpose of this thesis is to examine the traffic flow within a 
Naval communication station to determine the percentages of traffic 
which moved between the various combinations of input and output 
channels and also to deterinine if these input-output combinations 
differed with regard to the length of the messages handled, the preced- 
ence of the traffic, the time of day at which the messages arrived, and 
the length of time they remained within the communication station. The 
Naval Electronics Laboratory Center in San Diego expressed the original 
interest in the subject because of their involvement with automation 
projects for Naval Communications. Naval Communication Station, 

f 


its 


Same rarcisco was chosen as the station to be studied because o 


proximity and because of its possible future involvement in automation. 


A. BACKGROUND 

In order to provide background and understanding for this study, a 
brief description of the record-copy message processing capability of 
the Naval Communication Station, San Francisco is provided. 

NSC San Francisco is a major communications station located at 
Stockton, California. It provides an interface between the ships inthe 
Fastern Pacific area and the shore establishment. Outgoing traffic is 


provided by the flect broadcast. Incoming traffic from the ships is 








received on the primary ship-shore circuits. In addition, there are six 
full-period termination positions which with multiplexing can serve up 
to 18 ships on full-period dedicated circuits. 

The station is an AUTODIN tributary station off the automatic switch- 
ing station at McClellan Air Force Base. The AUTODIN terminal is an 
IBM 360-20 multi-media computer with perforated tape, page ee 
card capability. Incoming and outgoing Western Union TELEX and TWX 
circuits permit delivery of telegrams to and from individuals and civilian 
organizations. The Naval Communication Station is the Pacific Ocean 
Area processing point for personal telegrams for Class '"'E"' messages 
to individuals aboard ship. 

The station also has point-to-point "electronic courier" circuits 
with certain subordinate units and other land-based commands. Figure l 
summarizes the incoming and outgoing circuits of the Naval Communica- 
tions Station, San Francisco. 

Processing of the traffic is partially automated. The messages are 
received in the Communication Station as coded punched paper tape ata 
"yeceive'' position. The tape is torn off manually and fed to an in-house 
Transmitter Distributor (TD) which transmits the message electrically 
to the Multiple Address Processing Unit (MAPU). This "soft-wired"' 
reprogrammable unit reads the Format Line Two routing indicators and 
then delivers the message to the proper Basic Reperforator Punching 


Elements (BRPE) adjacent to the "'send'' terminal. The paper tape from 
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Pigure 1. NCS San Francisco Circuits 


10 








the BRPE is torn off manually and placed on the outgoing TD for 


transmission. 


Pee ZASON FOR THE STUDY 

Future concepts concerning Naval communications envision a 
limited number of fully-automated stations. There are at least three 
factors leading toward this autcemation. First, in order to provide 
money for the Fleet Satellite program, it is necessary to decrease the 
total number of communication stations; therefore, each remaining 
station must assume a greater share of the total traffic load. Second, 
the number of personnel in the Navy is decreasing, thereby reducing 
the nurmber available for manning communications stations. Third, 
with the increased reliance within the Services on automated information 
systems and more centralized control of the operating forces, there is a 
need for greatly increased masses of computerized data to flow at high 
speeds within the communication system. Solutions to these problems 
will require improved automation systems with highly efficient queing, 
poiling, and buffering schemes. 

The Navy Electronics Laboratory Center (NELC) is engaged in 
projects which include research into such improved automated ee 
tions networks for the future. They have stated a requirement for informa- 
tion concerning the internal flow within communications stations in order 


fPomurther such research. NELC felt that much information was available 


1] 








on each individual terminal such as the number of messages, the times 

of arrival, their precedence, length, and so on; but that there was no 
quantitative information in existence on the characteristics of the paths 
within a communication station ("'path'' being defined as the combination 
of an incoming terminal with an outgoing terminal). Some of the questions 
in which NELC expressed an interest are as follows: What are the various 
paths that the traffic follows within the station? What is the probability 
that each path will be followed; for example, what is the probability that 

a Message coming in on the AUTODIN terminal will go out on the 
Broadcast? Would the characteristics of the paths differ? Would there 
be certain paths that had a higher percent of high precedence traffic, or 
longer messages? Do the various paths differ with regard to peak load- 
ing time Or the time between receipt and delivery? It was felt that in- 
formation of this type would lead to the design of more efficient auto- 


mation systems for the fully automated communication stations. 


fee OD OF APPROACH 

In order to obtain this information it was decided to take the data 
available concerning messages on each of the incoming channels and 
match it with the corresponding informaticn available on each of the 
outgoing channels. This match was made by comparing the originator 
and date-time-group for each message. Various sampling schemes were 


considered. Random sampling of a certain number of messages over a 
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random sample of days was onc method considered, which would have 
given a reasonable amount of validity ina statistical sense. However, 
the sampling procedures during the day would have had to be done by 
hand because of the low probability of making a match of a message and 
its path if a sampie were made of the incoming and outgoing channels 
separately. The procedures would have involved taking samples from 
each of the incoming circuits including all of the various categories of 
incoming channel, precedence, time of day, length; and then tracing 
these messages through the Communication Station. This would have 
been a long, laborious task for the Traffic Analysis Section. It was 
determined that an exhaustive compilation cf ail the information on 
“each circuit would require much less effort by station personnel, with 
a computer program being used to match the incoming and outgoing 
channels for each message. The exhaustive sample of the traffic would 
provide a true picture of what happened on the days sarnpled, but would 
require further saimples to insure that the days sampled were 
representative. A two day sample was selected because of restraints 
OM time and resources available for this project. The two day sample 
would provide an insight into the types of problems that might be en- 
countered and the possible approaches for future investigations. 

It was decided that one day chosen should be an average day based 
on the total traffic load and the number of full-period terminations in 
use. Asa fleet exercise would be in progress within part of the time 


available fur sampling, it was decided also to choose a day during the 


ie 








exercise. It was considered that comparing the traffic patterns for 
these two days might provide some insight into changes that might be 
experienced in heavy traffic emergency operational situations. Un- 
fortunately, the exercise was less extensive than originally planned 


because of the oil cut-backs by the Arabian countries. 
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He Dee BASE CONEIGURATION 


pee OOLLECTION 

The raw data was collected by the statistical section of the Com- 
munication Station from the page copies that are retained for a 30-day 
period, or from the message logs. Messages included in the sample 
were those received or transmitted between 0001 and 2400 on the fifth 
and fourteenth of November. The fifth was chosen because it represented 
an average day. There were full period terminations with the Carlisle, 
the Kitty Hawk, the Coral Sea, and the Chicago. The normal broadcast 
channels were un: FMAA for destroyers, FMCC for service forces, 
FMEE for heavy ships such as carriers and cruisers, FCMP for the 
CW Mercast providing weather information, and FSPG which is an on- 
call circuit with the Coast Guard. The fourteenth was chosen because 
it was the day during the fleet exercise, BEL BANGO, with the heaviest 
traffic. Full-period terminations were up with the Chicago, Horne, 
Juneau, Jouett, Halsey, Bainbridge, Coral Sea, Paul Revere, Monticello, 
and Gridley. There was also an extra broadcast channel, XRTT, up for 
exercise traffic. Further information on activities during the exercise 
are available in Ref. 3. 

The data collected included the incoming (INCHAN) or outgoing 
(OUTCHAN) channel, the originator's routing indicator (RI), the date- 


time-proup (DTG), precedence (PREC), time of receipt (TOR) or time 








of delivery (TOD) as appropriate, and the length of the message. The 
length of the message was provided in line blocks as estimated by an 
experienced Radioman Chief. A line block as used here is 80 trans- 
mitted characters and may be made up of several physically printed 


lines. 


pee x OGCRAM 

Each incoming and outgoing channel was assigned a number for the 
purposes of cornputer handling. The numbering system was different 
for the normal and exercise day. Figure 2 is a listing of the number 
assignments. 

The programrning was done in COBOL F in several segments because 
of problems which needed resolution, and which are discussed further in 


the section entitled ''Assumptions. " 


The programs read the raw data 
from punched cards (input format is contained in the File Description 
section of the program) and sorts the data in ascending order by RI, DTG, 
Grennel) and TOR/TOD. This puts all of the transmissions of a message 
together with the incoming message first, since the incoming channel 
numbers are the lower numbers, and with multiple transmissions over a 
single channel in order of transmission. The program then creates a 
magnetic tape file consisting of single records (or multiple records for 
duplicate transmissions over the channel) for each message. Each 


Beeerd iacludes INCHAN, RI, DTG, PREC, TOR, length, TOD for each 


channel on which the message was transmitted, and a summary of the 
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CHANNEL NUMBERS 


Normal Day Exercise Day 
CHANNEL Incoming Outgoing Incoming Outgoing 
AUTODIN O01 (ol 51 71 
Ship-Shore O02 pie 
Skaggs Is. Ge DD ic 
FMAA Ge 73 
FMCC 24 74 
FMEE O15 a> 
FSPG 26 76 
FCMP oH ct 
oo Tae ve 
Hollister 03 
Coral Sea 04 33 54 83 
Chicago 05 34 B5 84 
Kitty Hawk 06 Be 
Carlisle Barracks 07 ad 
Horne 56 85 
Juneau 5% 86 
Jouett 58 87 
Halsey SP, 88 
Bainbridge 60 89 
Paul Revere 61 90 
Monticello 62 91 
Gridley 63 92 
Te 08 64 82 
Jim Creek 28 78 
Dixon 29 79 
Western Union 30 80 
i ae 81 


Figure 2, Channel Number Assignments 
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major categories of outgoing channels for each message (which is later 
referenced as Variable 26 (VARO26)). The TOD portion of the record 
is divided into twenty-five columns, one for each outgoing channel. The 
first column is for the AUTODIN, the next 6 for the various broadcast 
channels, etc. A blank ina column means that channel was not used. 
The summary column consists of a four digit binary code in which the 
left-most position indicates whether the message went out on the AUTO- 
DIN, the second digit indicates transmission ona broadcast channel, the 
third on a full-period termination and the fourth on the other miscella- 
neous channels. A one indicates that the message went out on at least 
one of the channels in that category and a zero means that it did not. 


Figure 3 illustrates a record from the first file. 


01 RUWJNA050400 PR 0410 13 0425 0424 0420...... 1300 
a b e d e f g h 
a) Incoming channel number 

Pyeeeericinator RI 

eC) ela G 

d) Precedence 

e) TOR 


f) Length in line blocks 

g) TOD on outgoing channels shown in columns 1, 3, and 5 of 25 possible 
columns 

h) Summary data on major types of outgoing channel, indicating trans- 
mission on AUTODIN and broadcast. 


Figure 3. Sample of a Record from File l. 


In addition to creating this file, the program provides a print cut of 


messages that are not handled by the program and require manual addition 
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to the file. They are messages with over il duplicate transmissions on 
an incoming or outgoing channel, and Western Union or locally originated 
messages which have duplicate outgoing transmissions. 

Some errors in the raw data were detected by inspection. Correc- 
tions were made by making individual changes to certain records in the 
file and by creating additional records and adding them to the file. 

From this first fiie which had a record for each incoming trans- 
mission, a second file was created with a record for each outgoing 
transmission. Each record includes INCHAN, OUTCHAN, TOR, length, 
PREC, and TOD from File 1, plus the time between receipt and trans- 
mission called time-in-house (TINH). Figure 4 illustrates the records 


that would be created from the sample record in Figure 3. 


0101 0410 Poeeeloe 0425 
0103 0410 14. 13 PR 0424 


0105 0410 10 13 PR 06420 


ab Cc d eT gat g 
a) Incoming Channel number 
b) Outgoing column number from File 1 
Cee LOR 
oe LINH 
e) Length 
f) Precedence 
eel OD. 


Figure 4. Sample of Records from File 2. 
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Gees ooUMPTIONS 

Upon initial screening of the output from the initial computer pro- 
grams, it was discovered that several assumptions would be necessary 
to complete the data base. 

1. Duplicates 

It was found that duplicates of individual messages, as indicated 
by duplicate routing indicator and date-time-group, were transmitted 
through the Communications Station. Some were duplicates on the in- 
coming channels with no duplicates on the outgoing channel; some were 
duplicates on the outgoing channel with no incoming message; and some 
were duplicates on both the incoming and outgoing channels. There are 
several possible explanations for these occurrences. 

Multi-section messages are transmitted with the same RI and 
DTG. Communications procedures specify that if a message has over 
100 lines of text, it will be broken and transmitted in sections. Also, 
there were certain short card-punched messages for computer use that 
came through with duplicated RI and DTG. 

Some messages may have been sent more than once and the 
duplicates noticed by the Communication Station in their screening 
Procedures. This would account for some duplicates that came in but 
did not go out. Also there were messages that were transmitted at least 
twice on the same outgoing channel while received only once, possibly 


because of requests for retransmission. 
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In creating the computer data files for purposes of analysis, 
messages that came in and went out as duplicates were matched up ona 
first in - first out basis. There were a few situations in which this 
procedure was modified because the TOR/TOD's indicated against it. 
Those duplicates that were sent out but had no incoming match were 
assigned a dummy incoming channel number of 17 on the normal day and 
67 on the exercise day. These duplicates which came in but had no out- 
going match were assigned a dummy outgoing channel number of 99 in 
File 2. 

2, Non-matching messages 

Another situation which required resolution was the large per- 
centage of the traffic which did not match up. It appeared that traffic 
was leaving without coming in or vice-versa. There are several explana- 
tions for this. A small percent of the official traffic does originate or 
terminate at the station and therefore would have no apparent incoming 
or outgoing channel, as the case may be. 

: Another category of message that originates and terminates at 
phe Communication Station is the service message. ‘This is a communica- 
tor's informal tool to request clarification of errors, to request duplicate 
transmission, or other procedural information. 

Telegrams are also received in and transmitted from the 
station. These telegrams can arrive by two methods: by commercial 


teletype or by telephone. From both these sources the messages are 


2) 





organized into the proper format for Navy communication channels by the 
Communication Station and assigned an originating routing indicator and 
DTG, making it appear that they originated at the Communication Station. 
Data was provided on the normal day fer the "'telephoned'' messages 
which were assigned an incoming channel number of 30. On the exercise 
day, data was provided only from the outgoing commercial teletype lines. 

Another explanation for the non-match is that because of the 
method of data collection, messages overlapped with the previous or 
following day's messages. Messages that came in before the sample day 
but were transmitted on the sample day would show up as cutgoing with 
no incoming match. At the end of the day, the reverse would be true; 
-‘messages would be received that would be retransmitted on the following 
day. 

Messages that originated at the Communication Station could be 
identified by the originator's routing indicator and were assigned a dummy 
incoming channel number of 19 on the normal day and 69 on the exercise 
day with the exception of the previously mentioned normal day's telegrams. 
Messages which had no apparent incoming channel but which did not have 
an originator's RI, were assigned a dummy channel number of 18 or 68. 
Messages that apparently terminated in the Communication Station ap- 
peared in File 1 as having no outgoing channels and were assigned 
dummy channel number of 99 in File 2. 

To summarize the channel numbers assigned because of assump. 


tions, dummy incoming numbers of 17 and 67 indicate outgoing duplicates 


Aa 








with no apparent incoming channel; channel numbers of 18 and 68 indicate 
no apparent incoming channel; and channel numbers of 19 and 69 indicate 
messages originated within the Communication Station. All messages 
that appeared to terminate at the Station were assigned an outgoing 
channel number of 99 in File 2. 

Sey Precedence and Time-in-House 

Some messages are assigned dual precedence, a higher preced- 
ence for the action addressee than for the information addressee. As it 
could not be determined which of the outgoing channels was the one re- 
ceiving the action traffic and which had the information traffic, only the 
higher precedence was used in precedence analysis. 

The amount of time recorded that a message spends in the Com- 
munications Station is a function of not only the actual time but also the 
accuracy of the clocks and time stamps used to document the time of 
Pe@e@emand time of delivery. For this reason there is a small percen- 
tage of traffic which appears to have a negative time-in-house. Also 
because of the method of time stamping, the ae of delivery is not 
totally accurate except on the AUTODIN circuit. On the AUTODIN 
Circuit the IBM 360-20 automatically prints the TOD on the file copy. 
However, in the other circuits the messages are likely to be bulk- 
Processed; that is, they are time-stamped in groups aiter they have 
been transmitted, rather than singly as they are transmitted. It was 


interesting to note that even with these limitations, over 90% of the 


a3 





traffic for which there was data had been transmitted within 30 minutes 


eiten receipt, 
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Mitre ls ROC OUR ICS 


The data base was analyzed by computer using the Statistical Pack- 
age for the Social Sciences (SPSS) [Ref. 2]. The SPSS combines many 
of the commonly used statistical techniques in a single program package. 
This package contains one-way frequency distributions and related 
statistics, table displays of relationships between two or more variables 
and related statistics, bivariate correlation analysis, partial correlation, 
multiple-regression analysis, scalopgram analysis, and factor analysis. 

Since one of the primary questions of the thesis is "What is the 
probability that a message coming in on channel X will go out on 
channel Y?'", the cross-tabulation subroutine of SPSS which provides 
table displays of relationships between two or more variables was the 
one chosen. The computer printouts displaying the relationships between 
the incoming channel (INCHAN) and the summary data (VAROZ6) on the 
types of outgoing channel, and between INCHAN and the outgoing channel 
(OUTCHAN) are given in Appendices A and B. 

In performing cross tabulations, the computer takes two variables 
and forms a matrix using the number of cases in each cell. Information 
is printed out by the computer on the number of cases that occurred in 
that cell, the percentage of the row total that number of cases constitutes, 
the percentage of the column total, and the percentage of the total number 


of cases used inthe matrix. Using the second cell in the first horizontal 
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row of the cross tabulation in Appendix B as an example, 232 messages 
came in on the AUTODIN channel and went out on at least one broadcast 
channel. The 232 messages constituted 23% of all the traffic that came 
in on the AUTODIWN channel, and 98.7% of the traffic that went out on 

the broadcast channels. It accounted for 18.2% of the total traffic during 
the day. The figures to the extreme right are the Row totals and the per- 
cent of the total traffic included in that row. The figures at the bottom 
of the chart represent the column totals and the column percentages. 

In the previously mentioned example 79.1% of the traffic came in over 
the AUTODIN circuit and 18. 4% of the total traffic went out over at least 
one broadcast channel. The channels that have ship names are full- 
period terminations with that ship. 

The second question was ''Do these various paths differ with regard 
to peak loading time, time between receipt and delivery, precedence of 
the traffic and length of the messages?'' Various statistical methods 
were considered for analysis of these variables, including correlation 
analysis and factor analysis. Correlation analysis provides a "single 
Summary statistic describing the strength of association between two 
variables''[Ref. 2]. The correlation procedures in SPSS investigate the 
linear correlation among variables which are at least ordinal in scale. 
Since there was no reason to believe that the relationship between the 
path and the time in house, for example, was linear and especially 


since the path was not ordinal in nature, this family of procedures was 
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not used. Factor analysis allows one ''to see whether some underlying 
pattern of relationships exist such that the data may be 'rearranged' 
or 'reduced' to a smaller set of factors or components that may be 
taken as source variables accounting for the observed interrelations in 
the data''[Ref. 2]. Factor analysis is based on correlation analysis 
and so the same objections held as discussed previously. Certain 
relationships among the data could have been analyzed using these 
methods; for example, the relationships among time in house, length, 
time of receipt and precedence. However, since these relationships 
were not primary objectives of this paper and because of time limita- 
tions this direction was not pursued. It was decided to do two-way 
cross tabulations of the various characteristics and these were done 
on the following combinations: Path by TOR, TINH, PREC and length; 
1NCHAN by TOR, PREC, and length; and OUTCHAN by TOD. 

each cell of the cross tabulations in Appendix B represents a path 
within the Communications Station. On the normal day, there were 26 
cells or paths containing entries so all paths were used for analyzing 
TOR, TINH, PREC, and length. On the exercise day, because of 
the large number of cells with entries, 70, it was decided to group the 
cells with fewer entries in order to keep the number of combinations to 
a manageable limit. Those cells with 10 or less entries were combined, 
and constitute 8. 4% of the total traffic on the exercise day. There 


were 26 individual paths analyzed, leaving 44 paths grouped as ''ten or 





less", Figure 7, in the results section, lists the paths analyzed and 
percent of traffic associated with each. 

Because of the voluminous amount of paper involved, these print- 
outs have not been included in this thesis but have been forwarded to 
NELC, 

In order to bring these cross tabulations into some sort of easily 
understood format, graphs of the characteristics were made on total 
traffic on the paths with the larger number of messages involved. The 
graphs to be included here were chosen with three basic criteria in 
mind: large traffic volume, even distribution between exercise and 
normal day traffic, and balance between types of channels (AUTODIN 
incoming and outgoing, broadcast, dedicated channels, and ashore 
channels). Figure 9 inthe results section includes ten individual 
graphs showing the percentage of traffic received each hour for total 
traffic and the following nine paths: AUTODIN-Skaggs Island, AUTO- 
Bye, AUTODIN-FCMP, AUTODIN-Corai Sea, Shipshore- 
AUTODIN, AUTODIN-Kitty Hawk, Coral Sea~-AUTODIN, AUTODIN- 
Juneau, and Horne-AUTODIN. Figures 10 through 12 show percentage 
patterns for TINH, PREC and length on the same paths just listed. 

It should be noted that as the number of messages per path decreases, 
a small actual-number variation causes large variation in percentage 


and apparently erratic behavior. 
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Smeeoit-SQUARE ANALYSIS 

Although the visual representations present a simple way of looking 
at the material and one can arrive at his own judgement as to whether or 
not the patterns are "the same" or "'different"', it was felt that statistical 
analysis would give a more objective determination as to whether the 
observed differences were significant. The Chi-square test was used 
pet. 1). 

One application of the Chi-square test is to explore the independence 
of the two set of observed events. An example would be the testing of 
TOR's to determine whether their pattern varied between the normal 
Mcmeeemerse days. If they are independent (i.e., not affected by the 
‘day), then the patterns will appear similar. If they are independent, 
an expected cell probability would be approximately equal to the product 
of its respective row and column probabilities. The data in Appendix A 
is used as an example to explain how the Chi-square is computed 
(although the test was not actually applied to this cross-tabulation for 
reasons explained below). In the example, in order to get the expected 
value of the cell in row 1, column 3, the row total, 1008, is multiplied 
by the column total, 213, and then divided by the cross-tabulation total, 
1274. The expected value for the cell would therefore be 168. The 
Chi-square figure is computed by summing up for all cells the squared 
difference between each observed cell value and its expected value 


divided by the expected value, 
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(Observed value minus Expected value) 
Brmpectca Vo bue 


The null hypothesis that the observations are independent is tested by 
comparing the computed Chi-square with the appropriate statistical 
table to indicate whether the differences exceed those expected by 
Chanee. 

In order to apply the Chi-square test a rule of thumb requires that 
each expected cell count equal or exceed five. Because of the large 
number of vacant cells, the total cross tabulations could not be analyzed 
in this manner; however, the method could be used on totals between the 


exercise and normal days and was so used. 
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The discussion of results is divided into four sections: a general 
overview of the results; a look at the cross tabulations for INCHAN 
versus the summary date (VAROZ6) in Appendix A, and INCHAN versus 
OUTCHAN in Appendix B; a look at the charts summarizing the paths 
versus the various characteristics; and the Chi-square analysis of 


some of the totals of the two day's traffic. 


A. GENERAL OVERVIEW 

For a broad overview of what the statistics provided, the table in 
Fig. 5 was compiled using the row and column totals of Appendices A 
aoameomeite totals were combined into four broad categories of AUTO- 
DIN, broadcast, dedicated circuits, and miscellaneous circuits. Their 
Sem@eroutions to both the incoming (from Appendix A) and outgoing (from 
Appendix B) message totals are displayed in Figure 5. As can be seen, 
the general flow of the traffic is into the Fleet Center via the AUTODIN 


and out on the other circuits. 


Pee ROSS TABULATIONS 

From Appendix A, it is apparent that only a small percentage of the 
traffic goes out on a combination of types of channels; that is on the 
AUTODIN and also on the broadcast, and so on. By adding the per- 


centages involved, it was discovered that 97.8 percent of the incoming 
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Biehl iN iO FF PERCENT OF 


INCOMING TRAFFIC OUTGOING TRAFFIC 
NORMAL EXERCISE NORMAL EXERCISE 
AUTODIN (Gel Gy 17.9 18.4 
Ser EomORE or 
BROADCAST 4.5 4.2 27.0 38. 6 
DEDICATED lees 14.9 16.3 18. 3 
MISC. Sel 55 oem 24.8 


Figure 5. Incoming and Outgoing Traffic by Broad Category 


messages on a normal day, and 93.6 percent on the exercise day went 
out on only one type of channel. 

Since the summary data from Appendix A provides information on 
the number of messages that went out on at least one of the general 
categories of channel, a natural question arising from this data is what 
percentage of the traffic that goes out on one of the channels also goes 
out on another channel of the same type. This information was obtained 
by a by-product of the program which initially sorted the raw data, and 
the results are provided in Figure 6. Only a small percentage of the 
traffic goes out on two or more channels of the same type, with the 
exceptions of the broadcast channels where approximately 21% of the 
traffic was duplicated. 

The first question raised in the creation of this project was ''What 
is the probability that a message coming in on Channel X will go out on 
Channel Y?'' The answer for the two days involved is provided in 


Appendices A and B. The information is summarized in Figure 7. 
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Multiple Transmissions Within Categories 
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Figure 7, Tabulation of Paths and Loads 
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Another grouping of the information that is of particular interest 
shows the amount of traffic that goes out on both the dedicated and the 
broadcast circuits. One of the purposes of bringing a ship up on a full- 
period termination is to keep the broadcast circuits from being over- 
loaded. It can be seen from the combinations in Appendix A containing 
both "B" and "'D", that few messages are in fact transmitted over that 
combination and so the dedicated circuits are cutting down on the broad- 
cast loading problems. On the normal day 249 messages were trans-~- 
mitted over the broadcast channels (singly or in combination), 235 were 
transmitted over the dedicated channels and only 5, or 2% of the broad- 
cast traffic, were transmitted over both the broadcast and dedicated 
channels. On the exercise day, 478 messages were transmitted over 
the broadcast channels, 247 went out on the dedicated channels, while 
42 or 8.8% of the broadcast traffic went out on a combination of the 
broadcast and dedicated channels. 

ieteealso interesting to note that only a very small percentage of 
the traffic -- .6% on the normal day and 2. 1% on ENeuecrercisce day — 
passed through the Fleet Center without in some way involving the 
AUTODIN circuit, either incoming or outgoing, singly or in combina- 
Piomewitn One of the other types of circuits. 

ive cells from Appendix A are listed below in Figure 8 in order of 
the amount of traffic they contain. Those cells with less than 10 mes- 
sages are not included. They make up 2.6% of the traffic on the normal 


day and 9.7% on the exercise day. 


3D 








SUMMARY oO POL TRAL LIC 
INCHAN (VAROZ6) NOR MAL EXERCISE 
AUTODIN Miscellaneous 43.1 28.8 
Po) LODIN Broadcast Fe. 2 Zones 
AUTODIN Dedicated 16.6 16.6 
SKAGGS ISLAND AUTODIN 5.6 
monnwtL SEA AUTODIN 4.9 
WESTERN UNION AUTODIN 4.7 
ele SHORE AUTODIN 4.2 4.0 
JUNEAU AUTODIN 26 
CARLISLE AUTODIN Zao 
HORNE AU TODIN i 
AUTODIN Broadcast- Dedicated Zz. } 
mor by HAWK AU TODIN 196 
VeUET T AUTODIN 1.4 
BALLS EY AUTODIN 1.0 
ARIS LE AUTODIN- Dedicated 0.9 
Be@m@siISTER AUTODIN Date 


Figure 8. Summary of Major Traffic Distribution 


ime second set of Cross Tabulations -- INCHAN by OUTCHAN -- 
feememesented in Appendix B. The patterns are very similar to those 
shown in the INCHAN by VARO26, as would be expected. There are two 
items that require further explanation. Two messages on the normal 
feyeama six on the exercise day came in on the AUTODIN channel and 
went out on the AUTODIN channel. This may have been caused by mes- 
sages that were misrouted to the station and, when discovered, were 
corrected and re-entered into the AUTODIN system. The thirteen 


messages that were received from and sent out to the Carlisle on the 


normal day were part of a communications test with that ship. 
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See ALTHS VERSUS VARIOUS CHARACTERISTICS 

Looking at the four charts of the totals in Figures 9 through 12, the 
daily patterns are similar for length, TINH and precedence. The TOR 
patterns are more subtle; however, there is a general tendency toward 
a bi-modal pattern with the heavier traffic late at night and early in the 
morning. 

Mee lime of Receipt 

The patterns for TOR, Figure 9, show wide variations between 

the days on the individual paths, even though the patterns for the totals 
appear similar. Also, the variation between days on the same path 
appears as great as the variation between paths. 

2. Time in House 

mor LN, the visual representations in figure 10 clearly 

show a very similar handling of the total traffic onthe two days. The 
most noticeable difference occurred in the AUTODIN-FCMP path, with 
modal point being 1-3 hours, vice the 0-15 minutes for the other paths. 
This is caused quite logically because the FCMP broadcast is trans- 
mitted at scheduled intervals, and the incoming traffic must await those 
intervals. The paths that include the AUTODIN as the outgoing channel 
show a different pattern indicating a faster handling time if the traffic 
is flowing toward the AUTODIN terminal than if it flows away. This 
could be caused by the method of bulk processing the TOD stamp on 


the outgoing broadcast copies vice the automatic stamping of the TOD 
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on the AUTODIN circuit; by the slower transmission speeds from the 
100 word per minute teletypes vice the 1500 word per minute IBM-360-2Z0: 
or by the smaller amount of traffic flowing toward the AUTODIN terminal 
allowing for less delay due to handling. There is some visual evidence 
for believing that variation between paths exists. 
oe Ff recedence 

In the graphs of precedence, Figure 11, there is a general 
pattern which shows a low number of Flash messages received, a slight 
increase in Immediate messages, a steep increase in Priority messages, 
and then a slight decrease in the number of Routine messages. The 
balance between routine and priority messages is closer on the exercise 
day, whereas it would be assumed that there would be greater amounts 
Smvener precedence traffic on the exercise day. The AUTODIN- 
Skaggs circuit with its higher percentage of Immediate messages shows 
the only major alteration to the general pattern. 

4. Length 

The original computer runs on path by length were made with 
length ranging from zero to two hundred line blocks and grouped by each 
ten line blocks as shownin Figure 12. Visual inspection of the numbers 
in the cells indicated that they would all follow the same general pattern 
with 80 ~ 100% of the messages in the cells being between one and ten 
line blocks long. It was decided to regroup the messages by each two 


line blocks up to twenty and to group all those greater than twenty into 
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one category. The over-twenty group comprised only 10.2% of the 
total traffic. Six to eight line blocks seem to be the most "popular" 
sizes. The variation between the paths appears to be greater than the 


variation between days for each path. 


Pree lil-SOUARE ANALYSIS 

The Chi-square tests were used to give an objective basis for 
determining if the observed patterns were significantly different. 

An informal survey of opinions on which of he four graphs for 
total traffic, Figures 9 through 12, indicate the "most similarity 
between days, "' resulted in agreement that TINH patterns were the 
most similar and TOR the most dissimilar. There was disagreement 
on ranking PREC and length. 

However, when Chi-square tests were run on the total traffic, the 
null hypothesis was rejected at the .005 level of confidence only for 
precedence and length. The differences for TOR and TINH could be 
expected by chance. The summary of the tests is contained in Figure 13. 

This would indicate that the pattern for TINH and TOR is general 
and might be expected to appear every day whereas it is probable that 
precedence and length are affected by days sampled and could be ex- 
mected tO vary. 

The visual observation and statistical analysis disagree, especially 
with regard to TOR which visual observation would indicate to have dis- 
similar patterns whereas the Chi-square test indicates similar patterns 


at the .005 level of confidence. 
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Figure 13. Summary of Chi-square Analysis 
of Total Traffic Between Days 

Visual inspection of the various charts within each characteristic 
grouping would appear to show that there is greater variation between 
the days on individual paths than on the total traffic. Chi-square tests 
were run on paths that allowed use of this test (because of the require- 
ment of having an expected value of at least five in each category). 
Analysis was done on the AUTODIN-Skaggs Island path for the TOR; and 
Pree LODIN-Skagpegs, AUTODIN-FMCC, AUTODIN-Corsl Sea, and 
Ship Shore-AUTODIN paths for precedence. Figure 14 presents the 


results. 


Again it was discovered that visual inspection and statistical analysis 
do not necessarily agree. Using the precedence graphs, Figure 11, the 
variation for the total messages was sufficient to reject the null 
hypothesis. Two of the paths, AUTODIN- FMCC and Ship Shore- 


AUTODIN, which appear to be more widely varying, however, do not 
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have sufficient statistical variation to indicate that the day sampled 


affects the precedence distribution. 
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Figure 14. Summary of Chi-square Analysis 
of Various Paths Between Days 
The difference between the Chi-square results and visual observa- 
fem in comparing total traffic patterns on individual paths may be 
caused by small variations appearing more extreme when a small 
number of cases is plotted as percentages. Consequently, the smaller 
the number of cases being plotted, the larger the visual variation will 


have to be in order to be statistically significant. 
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V. SUMMARY/CONCLUSIONS 


This thesis was undertaken to provide understanding of the traffic 
flow within the NCS Stockton from the incoming circuits to the outgoing 
circuits, to determine the percentage of traffic that flowed over the 
various paths (a combination of an incoming and outgoing channel) within 
the Station, and to determine if these paths differed with respect to the 
characteristics of time of receipt, time in house, precedence or length. 

A computer program was developed which takes two days of traffic 
(a normal day and an exercise day) and combines the data from incoming 
and outgoing messages to form records of the path/paths used by each 
meee sade. These records were analyzed using a packaged computer 
program for statistical analysis. 

It was found that the main flow of traffic is from the incoming 
AUTODIN to the other outgoing circuits. Approximately 95% of the 
traffic on the two days went out on only one type of channel (AUTODIN, 
broadcast, dedicated terminations or miscellaneous) and that, except for 
the broadcast channels which had approximately 21% duplication, there 
was little duplication within type of channel. Duplication between 
dedicated termination and broadcast channels amounted to 2% and 8. 8% 
of the traffic on the broadcast channels. 

The percentage of traffic that passed over each of the paths was 
segregated by the computer program and presented in matrix form in 


Appendices A and B. 
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The traffic characteristics (TOR, time in house, precedence and 
length) for total daily traffic showed similarities between days when 
viewed graphically. Statistical tests, however, indicated that the varia- 
tions for precedence and iength were greater than would be expected by 
chance, indicating that these characteristics do not have a pattern that 
is consistent from day to day. Proportionately more low precedence 
traffic was sent on the exercise day than on the normal day, which is 
not what would be expected by intuition. 

Concerning the question of variations between paths, the graphs 
showed that on the two days involved, the variation in TOR between days 
for each path was as great as the variation between paths ona single 
day. For TINH, visual evidence exists that there was a consistent 
variation between paths. Indications for precedence are that there is 
little variation between paths except for the AUTODIN-Skaggs Island 
path. The variation in length between paths appears to be greater than 
the variation between days for each path. 

Due to the conflict between the evidence presented by the graphs 
and statistics, further research is recommended covering several days 
over a wider period of time. This would provide a wider base for con- 
clusions and could be accomplished with careful assignment of channel 
numbers and minor modification to the computer programs created for 


Bars research. 
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APPENDIX A 


Cross Tabulation of INCHAN by Summary Data 
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